1 

Capacity with energy constraint in coherent state 

channel 

Masahito Hayashi 
Abstract 

We consider two kind of energy constraints wiien the output state is a coherent state. One is a constraint on the total energy 
during a fixed period; the other is a constraint on the total energy for a single code. The first setting can be easily dealt with by 
using the conventional capacity formula. The second setting requires the general capacity formula for a classical-quantum channel. 

Index Terms 

Energy constraint. Coherent state, Information spectrum, average error probability, pulse 

I. Introduction 

Recently, the demand for high speed optical communication has been steadily increasing. Optical communication is a 
communication technology sending information via transmitting photons through optical fibers. Optical communication requires 
energy because photons have energy, which increases in proportion to the average photon number. For reasons of economy, 
the average photon number is restricted to a fixed value. 

In conventional information theory, including quantum information theory, it has been usual to deal with the capacity, which 
is the maximum transmission rate with an energy constraint. In this formulation, the average photon number per single pulse 
is restricted. That is, the total average photon number is allowed to increase in proportion to the number of pulses in a single 
code. However, in real optical communication, the total average photon number over a fixed period, rather than the average 
photon number per pulse, should be restricted to a fixed value. 

Since coherent light is normally used for optical communication, it is natural to assume that only coherent states are available 
for the signal states. The attenuation channel is often assumed as the quantum communication channel. In this case, the output 
state of the coherent input state is also a coherent state. Hence, we only treat the case where the output state is a coherent 
state. 

In the present paper, with respect to the first setting, the amount of transmitted information when the total average photon 
number E during the fixed period is fixed and the number K of pulses during the fixed period is increasing can be changed. 

As for the second setting, the amount of transmitted information is treated when the total average photon number E for a 
single code is fixed and the number N of pulses for a single code is increasing. In this framework, the relation between the 
amount of transmitted information and the average error probability in the asymptotic setting is discussed. Since the restriction 
for a single pulse depends on the number N of pulses, we cannot apply the conventional capacity formula for the stationary 
memoryless channel. In order to resolve this problem, we apply the asymptotic general capacity formula for a classical-quantum 
channel, which was invented by Hayashi-Nagaoka[5] as the quantum version of Verdu-Han[4]'s general capacity formula. In 
this formula, the quantum information spectrum plays an essential role. 

In the third formulation, we evaluate the average eiTor probability based only on the average photon number C of coherent 
light and the amount N of transmission information in the non-asymptotic setting. In this discussion, Holevo's covariant 
measurement plays an essential role. We derive a general relation between the average photon number and the average error 
probability in a general framework. Some results in the second setting are recovered from this non-asymptotic formula. 
Therefore, we discuss the required average photon number for reliable communication based on coherent light from several 
viewpoints. 

The remainder of the present paper is set out as follows. In section [III we deal with the first setting, i.e., the amount of 
transmitted information when the total average photon number C during the fixed period is fixed and the number K of pulses 
during the fixed period is increasing. In section |III1 we consider the second and third settings, i.e.. Theorem [T] is presented (in 
the second setting) and Theorem |2] (in the third setting) is presented as the main results. In section |IV] we revisit the general 
capacity formula for a classical-quantum channel for the second setting and derive a useful general formula for the case when 
the output states are pure. This general formula is proven in the Appendix. In section [V] a proof of Theorem [T] is given based 
on the discussion in section |IV] In section [Vl] a proof of Theorem |2] is given based on group representation theory. In the 
appendix, an important theorem stated in section |IV] is proved. 
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II. Photon number constraint for a fixed period 

In this paper, we treat the A^-fold tensor product system 7i®^ of the Boson-Fock space TC, which is spanned by the number 
States |0), . . . , . . .. Coherent light with the complex amplitude a is given as |a) := e ~ J2'^=o "^l*^)' 

In coding theory, the classical input symbols are called input alphabets, and in our setting, they equal thie complex plane 
C. Then, the classical-quantum channel discussed here is given as a map from C to the set of density operators on H 
of the form a i-^ \a){a\. The A^-fold memoryless extension is given as a map from to the set of density matrices 
on the A^-th tensor product system 7i^^. That is, this extension maps the input sequence a = (ai, . . . , ajv) to the state 
\d)(d\ := \ai, . . . ,aN){ai, . . . ,aN\- 

Sending the message {1, . . . , Afjv} requires an encoder and a decoder. The encoder is given as a map (^tv from the set 
of messages {1, . . . , Mn} to the set of alphabets C^, and the decoder is given by a POVM = {Y/^jfil- The triplet 
$Ar :— {AIn,(Pn,Y^) is called a code. Its performance is evaluated by the value |$jv| ■— Mn and the average error 
probability, given by 

Mn ^ 

The code ^n ■= (Mn, fN, Y^) is required to satisfy the average photon number constraint 

{ipN{i)\N\ipN{i)) <NE for -ii (1) 

where the number operator N is given as [rii + . . . + njv)|ni, . . . , nN){ni, . . . , tinI- This condition is equivalent 

to the condition 



N 

|2 



i=0 



where a ^ ipN{i)- 

Then, the channel capacity with the average photon number constraint ([1]) is given by 



The capacity C{E) is calculated as[l] 



C{E) =^ sup I lim inf ^ log | $a, | 



e[^N] ^ 
O holds 



C{E) = max H{p) = {E + l)\og{E + 1) - E\ogE, 

Ti pN=E 

where H{p) = — Trplogp. 

Hence, when the number of pluses during a fixed period is restricted to K and the average photon number per second is 
limited by E, the bound of the possible amount of transmitted information is given as 

^^(|) - m| + 1) log(| + 1) - I log |)- 

When the condition K is fixed, this limit is a finite value. However, if the number K is sufficiently large, this value has the 
following asymptotic expansion: 

KC( — ) = ElogK + E - ElogE + — — , 

which implies that we can increase the number of sending bits by increasing the number K of sending pulses per second with 
the fixed average photon number constraint. 

This type of phenomenon does not occur in the classical Gaussian channel. When the channel noise is specified as a Gaussian 
distribution with variance V, and the average photon number constraint is given as the condition that the input intensity per 
signal is E, then the capacity is 



Cc{E,V) :=ilog(l + |; 



Hence, 



A-C,(f,l') = 4l0g(l + ;^)S»^-|l-i 
Thus, even though the number K of pulses per second increases, the capacity is limited to 
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One might consider that this comparison is inappropriate because the coherent state case is noiseless and only has attenuation 
whereas the classical Gaussian case does include noise. However, the variance in the estimation of the state family {|Q;)(Q;||a G 
C} behaves as 0(-^) asymptotically when N copies of the unknown state are prepared. This behavior coincides with the 
estimation of the unknown expectation parameter of the classical Gaussian distribution with fixed variance. The former 
uncertainty is caused by quantum non-commutativity, and the latter uncertainty is caused by classical noise. Usually, both 
cases have a similar asymptotic behavior, as happens with statistical state inference. However, for the situation here, the cases 
have different asymptotic behaviors. Thus, the difference discussed here can be regarded as a special phenomenon in the case 
of the capacity of the photon number constraint for a fixed period. 



III. Photon number constraint for a single code 

In this section, we consider the relation between the transmission amount and the average photon number from another 
viewpoint. First, we consider the transmission amount when the total average photon number is restricted to the value E. Our 
condition ([T]) is replaced by 

((/.w(z)|iV|(^^(z)) <SforVz. (2) 

Since the number of transmitted bits is of order log N, we define the capacity with respect to the logarithmic order with error 
probability e as 



Cie^m'^ sup (liminf.^°S"*^" 

{*iv}S?=i 



N^oc Ice N 



^ holds 



Theorem 1: The capacity Ci{e,E) is as follows: 



mVe-^— <e[ (3) 

n=0 J 

To prove this result, we cannot apply the conventional formula for a memoryless channel. So, we prepare an information 
spectrum method in Section IIVI and present a proof in Section |V] 

In the above theorem, we consider the transmission rate to logarithmic order with the condition for the average photon 
number and number of pulses in the asymptotic setting. In the following, we consider the transmission size under the average 
photon number constraint without any constraint on the number of pulses in either the non-asymptotic setting or the asymptotic 
setting. In this framework, we can use any number of pulses for information transmission, assuming the average photon number 
constraint. The following theorem holds. 

Theorem 2: When any state \ of a code $ satisfies dU, the inequality 

em > 1 - + + (1 - ^) VT^^) ' (4) 

holds. 

Now, we denote the right hand side of (HJl when |$jv| — by e{E,R). When R goes to infinity, we obtain its asymptotic 
expansion, which depends on the behavior of E — R: 



e 



^ 2Vl - e-^^e-(^+«)/2 - (1 - 2e-^)e-« if ^ - i? 



e{E, R)^l (l + 2e^ - 2^e^(l + e^)) er^ < e 



E 



\fE-R^ A 



When the average photon number E is fixed and the information size R is sufficiently large, the error probability is greater 
than e~^. Thus, the reliable transmission of a large amount of information requires a high average photon number This 
evaluation does not depend on the number of pulses. In other words, in order to keep the error probability smaller than p, we 
need the average photon number to be at least — logp . This observation coincides with that of Theorem [T] Conversely, when 
the average photon number E is large relative to the information transmission size R, the obtained lower bound of the error 
probability rapidly approaches zero. That is, this lower bound does not yield the crucial bound for the error probability. In this 
case, in order to evaluate the error probability more precisely, we need to take account of the number of pulses. 



IV. Information Spectrum Approach to a Pure States Channel 

For a proof of Theorem [T] we cannot apply the conventional capacity formula for the stationary memoryless channel. 
Instead we employ the asymptotic general capacity formula for general sequences of a classical-quantum channel by Hayashi- 
Nagaoka[5]. In the general capacity formula, we focus on the sequence of Hilbert spaces sets of alphabets {X^^'^}, 
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and classical-quantum channels W := {W^^^, where the channel W^^^ maps an alphabet x G A'^^) to the density operator 
wi^' on TL^^\ For any sequence {a^^} satisfying the condition aat — > cx), we define the capacity 



C{e\W)'^= sup (liminfi^^^ lim < e 



7V- 



where $jv expresses a code for the classical-quantum channel VF^^^ For any sequence of probability distributions p(^) on 
we define the information spectrum quantity 

/(e|P, W) := sup (fe limsup / Tr M^i^){VKi^) - e''"''M^^7i, < {)}P'^^\dx) < e 1 , 

where the projection {X > 0} is defined as J2i x >o when the spectral decomposition of X is given as J^i XiEi, and 



Then, we obtain the following formula: 

C{e\W) = sup I ie\P,W), (5) 



p 



where P is a sequence {P^^^} of distributions. This formula can be obtained by combining Theorem 6 of Verdu-Han [4] and 
Lemmas 3 and 4 of Hayashi-Nagaoka[5], as is mentioned in Remarks 8 and 11 in Hayashi-Nagaoka[5]. 

When Wx^^ is a pure state for all x e the quantity I{e\P,W) can be characterized by the information spectrum 

quantity, Wp := {W^!^^}. For a sequence of density operators p := {p*^^'}, we define H+{e\p) by 

H+ie\p) :=sup|5 limsup Trp(^) |zliogp(^) <b\ <e 

Then, when wi^'' is a pure state for all x G X^'^\ the following theorem holds. 
Theorem 3: Assume that Wx^"^ is a pure state for all x G X^'^^ Then, the relation 

i7+(e|Wp) -/(e|P,W^) (6) 

holds for any real number < e < 1. 
The proof of this result will be given in the Appendix. 
Combining (|5]i and Q, we obtain 

C(e|W) = supH+{e\Wp). (7) 
p 

Further, the quantity H^{e\p) satisfies the convexity requirement as follows. 

Theorem 4: For any two sequences pi :— {pf^^} (i = 1, 2) and < t < 1, we define the sequences ps :— {tp"i^^ + (1 — 
i)p2^-'}. When p^^'' is unitarily equivalent to P2 '^hen 

i?+(e|p3)>i?+(e|pi). (8) 
Proof: In order to apply the majorization theory for eigenvalues, we denote the j-th eigenvalue of the matrix X by \j{X), 
listing the eigenvalues in decreasing order. Using Theorem III.4.1 of Bhatia [8], we obtain 

i=i i=i j=i j=i 

which implies dU. ■ 

V. Application of the information spectrum approach 
We apply the formula (|7]) to our problem. Then, we obtain 

Ci{e,E)^supH+{€\(Tp), 
p 

where = log TV, ct^^].) /civ \a){d\P'~^\dd), and the support of p(") is {a G C^||la|p < E). Theorem g] guarantees 
that 

Ci{e,E)= sup H+{e\cTp), (9) 
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where the sequence of distributions 'Pmv ■— V^inv} defined as 

'^inv '■~ {P'^^^ jP*^^-* and this sequence is invariant under any action of U(C^)}. 
For any distribution p(^) e 'Pinv ' there exists a distribution p(^' on [0, \/E\ such that 

where a = rfi and /iAf is the invariant measure on the N — 1-dimensional sphere. In the following, we prove that 

Imiinf Tra(T2,{i— ^ loga^T^, < 4 > E ^" V ^^'^ 

° n=0 

when c is not an integer and m is the maximum integer less than c. The equality holds when the distribution p(^) is the delta 
measure on {VE}. Thus, suppg-p.^ ^ H+{t\(Tp) = sup„{m| Y^^n=o^~^ 1^ — Combining with (|9]l, we obtain 
Next, we prove (fTOl l. The state ct^^) can be written as 

"'[0'V^ln=0 I N-1 j n=0 

where the projection n„^jv and the eigenvalue are defined as n„^ at J2n,\\n\\=n \^){^^\ and := (/[q^vb] e"*"^ l^^''^'' ('^'^)) (-w+i-i-j ■ 
Since the eigenvalue is evaluated by 

{iSiTv log'^pT«) < c} < Er=onn,w- Hence, the probability Tr (7f^l^{^^ log a'^p^l^ < c} is evaluated by 

Tr.S.(i^log.™. <c)<|:J^^^e-'-!-,p'»>(*,. (11) 

where the integer m is the maximum integer less than c. Now, we treat the opposite inequality when c is not an integer and 

N > e^/" (12) 

by considering m + 1 cases: namely the cases (0), (1), . . . , (m). 
Case (0): Assume that the inequality 

Ao,VE] n\ { N-i ) 

holds for all n < m. Then, for n <m. 



— <(f e-'''—P^-^Udr)) ^- 



which implies that -^^^ log A^ < c. Thus, the equality of ( fTTl ) holds. 

Case (n) (n = 1. . . . , m): Assume that the inequality (fTsT i does not hold for the integer n. Note that the inequality (fT3b 
always holds for n = for sufficiently large N. Since x i-^ is a convex function, the average := Jjp P^^\dr) 
satisfies 



n! (i + i^).-^ n! (i + - ^ ^ ^ ^^^+V) 



n—l\n (N+n—l\ 



because 7V"(1 + 2^)" > (^^"7^). That 



IS, 



The eigenvalue corresponding to the vector |0, . . . , 0) is Jj^ e P'-^^dr), which is larger than e because x i— > e 
is convex. Since the condition (fT2] l guarantees that e^*"" > e^^ > we obtain j^^j^y log A^ < c. Thus, 
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Since the right hand side of ( fTTT i is less than 1, 



m „ 2n 1 



Considering all cases (0), (1), ... , (m), we obtain 

„2n 



^J[0,n/:B] n\ log TV l<n<m 

which goes to 0. Therefore, 

liminfTra(,^2,{-i-loga(,^2. <c} = liminf / ^ ^^^(dr) > ^ e"^^, 

where equality holds when p(^) is the delta measure on {a/E}. Therefore, we obtain (fTol i. 

VI. Group Covariant Approach 

First, we consider the case when the state \(p{i)) is given as \fi) := y/p\0) + \/l — p\i) where p — e~^, M = |$| and 
|0), |1), . . . , I A/) are orthogonal to each other We focus on the permutation group Sm whose representation V is given as 

v,m = m), y,(io)) = io) 

for any g g Sm- Any one-dimensional subspace of JCi :—< |0), \S) > is an irreducible space, where \S) :— -:^J2jLo b)- 
The remaining irreducible space IC2 is the orthogonal space of ICi. We now define the states \S') and as 

y^p+(l-p)/M VM 
\/M 1 
' ' VAf - r' ' ^/M 
Note that |S") belongs to /Ci, and belongs to /C2. Then, the state |/,;) can be written as 



Thus, all states belong to the space /C3 /C2, where /C3 is the one-dimensional space spanned by \S'). Since the average 



correct probability jj X]^=i(/il^il/i) is invariant with respect to the action of the permutation group: 



M ' " ' M 

i=i j=i 

we can apply Holevo[7]'s group covariant measurement theory. Thus, our optimization problem can be restricted to an 
optimization problem among the POVM {Yj}j of the following form: 

Yj = \uj){u^\, 



where \uj) = + ^^^^^^I'^j) l^i) ^ vector in IC2- For this restriction, the maximum value of (fi\Yi\fi) is 

realized when \vj) = and the maximum value is 



1 , — - — -— r " M ^/M - 1 

^p+{l-p)/M+^{l-p) 



M V M-1 Vm 



which is equal to + (M — l)p + (1 — — pj . Therefore, we obtain dUl. 

Next, we prove the general case. The state \ip{i)) has the form y/p\eo) + y^l — pjci). However, the vectors |eo), |ei), . . . , |eAf ) 
are not necessarily orthogonal to each other Define the TP-CP map £ from the system spanned by . . . , I/a/)} to the 

system spanned by {|eo), |ei), . . . , |ej\/)} as follows. Here, \fj) is the vector defined above. First, we operate with the unitary 
operator U : U\fj) = {ifiii)) <8) \hj), where the states \hj) are an orthogonal basis on the additional system IC4. Next, we execute 
a partial trace with respect to the additional system IC4. Then, the TP-CP map £ is defined as 

£{p) :=TrK, UpU^. 

Thus, any POVM {Y, } satisfies 

Since {£^{Yj)} satisfies the condition for POVM on the system spanned by {|/i), . . . , I/a/)}, we obtain the inequality (|4]l. 
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VII. Discussion 

The present paper discusses the relations between the average photon number constraint, the amount of transmitted informa- 
tion, and the average error probability. While the second setting is based on an asymptotic framework, the analysis of mutual 
information is not sufficient in this setting and an information spectrum approach is required. 

As is shown in the second and third settings, the average error probability is greater than e^^ when the total average photon 
number is E and the amount of transmitted information is sufficiently large. In particular. Theorem [T] guarantees that the 
minimum error probability e^^ is realized when the amount of the transmitted information is the logarithm of the number of 
transmitted pulses. Indeed, such a code can be constructed as follows. For an arbitrary integer N, we define a code $ with 
the set of messages {!,..., N} as follows. The encoder (p is given as 

= |0)(Or(*-i) ® \a){a\) ® |0)(0|«("-^), 

where |ap — E. The decoder {Yi] is as follows. 

Y, := |0>(0|^(*^i' (g, (/- |0)(0|) (g, |0)(0|^("-^^ 

Then, the error probability is 

i-Mz)|r.|^(*)) = i-e-^. 

The above code is realizable with current technology. This construction suggests that increasing the number of pulses yields 
quantum advantages over a classical Gaussian channel. However, it is not so easy to increase the number of pulses for a fixed 
period in optical communication. Development of a physical scheme to increase the number of pulses is required. Exploring 
such a scheme remains a topic for future study. 
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Appendix 

First, we prove the inequality 

I{e\P,W)>H+ie\Wp). (14) 

For this purpose, we apply the discussion of Theorem 1 in Nagaoka-Hayashi[6] to the hypothesis testing Wx^^ x P''^\dx) 
vs W^p^) X P^'^^dx) on the composite system between the quantum system Ti'^^ and the classical system X^'^K In this 
case, the sequence of sets of projections {{wi^^ — e"""Wp^) < Ojjxexc^AN yields the best test. Choose an arbitrary real 
number bo satisfying 

bo>I{e\P,W). (15) 
Thus, when any sequence of sets of projections {{^x^''}a;eA'(")}A^ satisfies the condition 

liminf — log / TrM^^72,AWpW(dx) > 6o, (16) 



then 



lim sup / TrM^i^)(/-ylW)p(A^)(rf^) > 



Define the projections 



and 



Bb,N ■■= {I - e""'W^|7i, > 0} 

AN) 



— Bh^NWx Bh^N if Bb^N 7^ 
if BbM = 0. 
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Then, the quantity H+{e\Wp) can be expressed as follows. 



H+{e\Wp) :=sup<j6 

b 

= sup { b 

b 

— sup { b 

b 



lim sup Tr M^^^i, (/ - B,,jv) < e 



lim sup / Tr[Wi^)(/ - Bb,N)]P{dx) < e 
lim sup / Tr[W^i^) (/ - Bb,N,cc)]Pidx) < e 



Since Bb^N,x C B^.n, all eigenvalues of Bb^N,xW'p^J,^Bb^N,x are less than e Since Bb^N,x is a rank-one projection or 
zero matrix, Bi,^n,xW^J,^Bi,^n,x < e~°"^wi^'', which implies that 

Thus, Bbg.N x satisfies the condition (fTSI l. Therefore, 



limsup / Tr[VFW(/ - Bb„,jv,x)]P(da:) > e, 



which implies that 60 > Since 60 is an arbitrary real number satisfying ( fTSl ). the relation (O holds. 

Next, we prove the opposite inequality 

I{e\P,W)>H+{e\Wp). 
Let S be an arbitrary real number satisfying that S > 0. Define the vector \(t)N,x) by 

f {H^i^) - > 0} if {W^^^ - e""(^+^)W^(fi, < 0} ^ 



(17) 



,x 



Then, 



That is. 



W^''^ if {W^""^ - e"«(''+^)W'^fi, < 0} = 0. 

''W'^ii^ yields the result that 

N,x\il - Bb,N)\4>N,x) = {'/>N,x\{e""''W^p'!l -I> 0}\(bN,x) < e""'^ . 



Thus, the relation {e°"''M^^'J^i) - / > 0} < e^'^^W^p^},^ yields the result that 



Since B^^n is a projection. 



Thus, 



\\\<t'N,x){(t)N,x\- BbM\4'N.x){4>N,x\Bb,N\\l < '^y {4>N,x\iI ^ Bb,N)\4>N,x) < 2e 2 . 

1 - TrP^W{W^W - e'^"(''+')Ty],fi, < 0} = TrW!^''^\^N^x){^N,x\ < TtW!^""^ BbM^N,x) {^NMBbM + e'^-' 
< Tr W^'^'>Bb^N + e-""^ = 1 - Tr W^^^I ~ Bb,N) + e"''"^ 
Therefore, 

limsup / TrWi^^iw!^^^ - e'^^^+^^W^^^i, < 0}P'^^\dx) > limsup / Trl¥i^)(/ - Bb.N)P'^^\dx), 
which implies that I{e\P, W) — 5 < H+{e\Wp). Since 6 is an arbitrary positive real number, we obtain (1% . 
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